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ABSTRACT 


Linear  estimation,  theory  has  been  applied  extensively  to  nonlinear 
systems  by  assuming  that  perturbations  from  a  reference  solution  can 
be  described  by  linear  equations*  As  long  as  the  second  order  (and 
higher)  terms  in  the  perturbation  equations  are  negligible,  linear 
estimation  techniques  have  been  found  to  yield  satisfactory  response* 

Many  examples  have  been  encountered  in  which  the  linear  theory  is  not 
satisfactory,  however,  and  it  is  to  this  situation  that  attention  is 

directed  here*  Time-discrete  systems  in  which  the  second  order  effects 

% 

are  small  but  nonnegligible  are  considered.  Recursion  relations  for 
the  conditional  mean  and’ covariance  are  developed.  While  these  relations 
yield  approximations  to  the  true  values  of  these  moments,  they  are 
superior  to  the  approximations  provided  by  applying  linear  theory  to 
a  nonlinear  system*  Some  results  for  a  simple  system  are  presented 
in  which  the  response  from  linear  and  nonlinear  filters  is  compared. 
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»  * 

1.  INTRODUCTION  ~  ' 

•’  * 

Considerable  attention  has  been  devoted  to  the  problem  of  determining 

unbiased,  minimum  variance  estimates  of  the  state  of  linear  systems 
from  noisy  measurement  dat zt  For  linear  systems  with  additivo  white 
noise  the  solution,  frequently  referred  to  as  the  Kalman  filter,  is 
well-known  [l]  and  it  has  received  application  to  a  variety  of  engineering 
problems*  When  the  system  is  nonlinear,  linear  perturbation  theory  is 
introduced  and  the  linear  filter  is  applied  to  estimate  the  state 
perturbations.  This  has  been  found  to  provide  satisfactory  results  in 
most  cases  but  occasions  [2\  have  been  encountered  in  which  nonlinear 
effects  prove  to  have  a  very  deleterious  effect  upon  the  filter  response. 

In  the  succeeding  discussion  a  nonlinear  filter  is  presented  that  is 
appropriate  for  use  when  the  second  order  terms  are  small  but  nonnegligibie. 
Computational  results  indicate  that  a  significant  improvement  relative 
to  a  linear  filter  can  be  obtained  using  these  relations. 

A* considerable  portion  of  the  recent  research  into  nonlinear  filtering 
has  been  concerned  with  the  determination  of  the  a  posteriori  density 
function  of  the  state  conditioned  on  all  available  data.  Having  this 
density,  then  any  type  of  estimate  (e.g.  minimum  mean  square  error, 
minimum  absolute,  most  probable)  can  be  obtained,  theoretically.  Because 
of  the  difficult1 os  involved  in  using  the  general  results  that  have  been 
obtained,  attention  has  been  directed  to  methods  of  approximating  either 
the  density  or  its  moments.  This  approach  has  been  taken  by  Bass,  Norua 
and  Schwartz  [sj  for  continuous  systems.  They  approximate  the  true  a 

posteriori  density  by  assuming  that  it  is  represented  by  the  first  and 

\ 

second  order  moments  (all  higher  order  moments  are  negligible).  In  the 
investigations  presented  below,  time-discrete  systoms  are  considered 
and  the  a  posteriori  density  is  assumed  to  be  gaussian  and  recursion 
relations  for- the  mean  and  covariance  are  derived.  The  relations  that 
are  obtained  are  similar  to  the  relations  for  linear  systems  but 
incorporate  the  effect  of  second  order  terms  in  the  plant  and  measurement 
equations.  .  , 
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The  general  problem  and  its  solution  are  presented  and  discussed  in 
Section  2.  No  justification  for  the  results  are.  given  there.  This 
discussion  is  given  in  Section  3  for  a  scalar  system,  A  scalar  rather 
than  vector  system  is  considered  .because  the  notation  is  simpler  and 
thereby  avoids  uninteresting  complications  in  the  discussion.  The  * 
generalization  to  the  multidimensional  results  presented  in  Section  II 
is  straight  forward.  A  simple  numerical  example  is  discussed  in  Section 
IV  and  the  response  of  the  nonlinear  and  linear  filters  is  compared.  . 


II.  GENERAL  PROBLEM  AND  ITS  SOLUTION ■ 

The  state  of  a  dynamical  system  is  to  be  estimated  from  a  collection 
of  discrete  measurement  data  (z.  ...  z.  ).  The  evolution  of  the  state 

1  j  K 

is  assumed  to  be  described  by  a  system  of  nonlinear  difference  equations 


xk  *  pk  xk-l  +  sk*xk-l*  +  wk-l  *2,1) 

The  m-dimensional  measurement  data  z^  at  each  sampling  time  have  a  known 
nonlinear  relationship  with  the  state 


'n  “ H*  \  *  W  +  Tk 


(2.2) 


In  (2,1)  and  (2.2)  the  system  is  assumed  to  be  completely  known  except  for 

the  initial  state  xQ  and  the  plant  and  measurement  noise  sequences 

and  v,  , '  The  n  and  a-dimensional  vector  functions  g.  and  e.  are 
3  th  *  * 

known  and  are  such  that  the  i  component  is  given  by 

% 

Kfc  ®  xk-l  xk-i  ^ 


i  T  i 
ek  *  XR  Ek  xk 


(i  *  1|2,  ...  jia) 


l  * 

whero  the  and  are  symmetric  (n  x  n)  matrices.  The  Ffe  is  a  (n  x  n) 
matrix  and  the  is  a  (m  x  n)  matrix. 

The  initial  state  and  the  noise  sequences  are  assumed  explicitly  to  bo 
gaussian  and  to  be  mutually  uncorrclatod.  Further,  the  noise  sequences 


are  taken  to  be  uncorrelated  between  sampling  times  and  'shall  be 
referred  to  as  white  noise  sequences.  The  initial  state  has  the 
density 


p(xo)  *  ko  exp  -  |(x0-a)TM;1  (xo-a) 


and  at  each  sampling  time  the  r  >ise  samples  have  the  densities 


p(v3)  =  k  exp  -  i  **  Qj1 


.  ,  .  1  T  n-l 

»(riJ  *  kv  -  2  ri  *1  TJ 


C?.3) 


(2.4) 


(2.5) 


There  are  many  possible  types  of  estimates  that  could  be  considered  for 
this  problem.  Cox  [£3  has  considered  estimates  for  nonlinear  time-discrete 
systems  that  are  chosen  to  maximize  the  a  posteriori  density  function. 

The  conditional  mean  is  known  to  give  the  estimate  that  minimizes  the 
mean  square  error  and  this  estimate  is  considered  here.  Thus,  the 
discussion  will  be  directed  toward  approximating  the  mean  of  the  a 
posteriori  density  function  in  a  recursive  fashion. 

The  manner  in  which  the  a  posteriori  density  ptx^/z  )  evolves  from  one 
sampling  time  to  the  next  is  easily  .established.  Specifically  when  the 
noise  is  uncorrelated  between  sampling  times,  then  the  general  recursion 

J 

relations  are  given  by 


,  ,  k,  p(x  /*k)  pUkAk> 

and 

p(xk/zk"1)  =  j  P<*k_1/»k”1)  P< Vxk-l)dxk-l 
*  k*  1 

where  the  normalizing  constant  p(z^/z  )  is 

•j  , 

p(zk/zk-1)  a  j p(xk/zk"1)  p(zfe/xk)dXk 


(2.6) 


(2.7) 


(2.8) 


•  k  /  » 

The  notation  a  will  be  used  to  denote  the  set  (a1taot  ...  ,  a.) 
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,  •  a 

The  integrals  are  n-dimensic  \al  in  (2.7)  and  (2.8).  •  * 

The  density  >  p(x./ak  *)  described  by  (2.6)  will  be  referred  to  as  the 

*  If 

prediction  density  whereas  p(xk/z  )  will  be  designated  as  the  filtering 
density.  In  general  it  is  very  difficult  to  perform  in  a  closed  form 
the  operations  required  in  (p„7)  -  (2.8).,  Also,  it  is  difficult  to 
determine  the  moments  (specifically  the  mean)  from  (2.6)..  This  is 
discussed  in  Reference  5.  However,  if  the  moments  of  p(xk  j/z*”’1)  are 
known,  it  is  a  relatively  straight  forward  procedure  to  determine  the 
moments  of  the  prediction  density  p(xk/zk"1)  if  the  plant  noise  is 
additive  and  gaussian.  This  will  be  discussed  further  in  Section  III 
as  will  the  difficulties-  involved  in  obtaining  the  moments  of  p(x. /zk). 

K 

When  the  system  is  linear  (i.e.  gfc  and  ek  are  identically  zero  in  (2.1) 
and  (2.2)  ),  then  the  operations  in  (2.6)  —  (2.8)  can  be  carried  exactly 
and  are  known  to  be 


p<V*k-‘)  *  V-i'xp  -  t(V*kA-l)TfkA-l(VVk-i)  <2-9> 


and 


where 


.  kkexp  -  t<Vil/'k1<V4k> 


and 


Vk-l  *  *k  xk-l 

PkA-l  =  Fk  Pk-l<Fk  +  ^k-1 


K  ■  *kA-i  +  KkCVHk5kA-i] 


Kk  ■  pkA-i  HkPkpkA-i  Hk  +  “J"1 


(2.10) 

(2.11a) 

(2.11b) 

(2.12a) 

(2.12b) 


Pk  "  PkA-l  "  Kk  Hk  PkA-l 


(2.12c) 


Equations  (2.11)~  (2.12)  are  known  as  the  Kalman  filter  equations* 

These  relations  have  been. stated  for  ease  of  reference  for  the 
following  discussion. When  the  system  described  by  (2.1)  -  (2.2)  is 
considered,  it  is  no  longer  possible  to  obtain  the  exact  relations  for 
the  moments*  Because  of  the  nonlinearities, ,  the  prediction  and  filtering 
densities  will  lose  the  gaussian  character*  In  general,  these  densities 
will  lose  thoir  symmetric  properties  and  the  skewness  will,  be  nonzero.. 

This  means  that  the  maximum  point  of  the  density  will  no  longer  correspond 
to  the  conditional  mean, 

• 

If  the  nonlinearities  are  relatively  small,* then  the  skewness  will  be 

small  and  the  density  will  retain  an  essentially  gaussian  character. 

k*«l  k 

Supposing  that  the  p(x^/z  )  and  p(Xj */z  )  can  be  approximated  by  the 
gaussian  density  even  in  the  presence  of  .he  plant  and  measurement 
nonlinearities,  then  these  densities  will  be  described  by  (2.9)  -  (2*10). 
The  moments  *k’  no*  correspond  with  (2.11)  -  (2-12) 

in  this  case  because  of  the  g^  and  in  (2.1)  -  (2.2).  The  following 
relations  are  proposed  as  approximations  of  these  moments.  Their 

i  , 

derivation  is  discussed  in  Section  III. 


(2.13a) 

(2.13b) 
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For  the  filtering  density  the  moments  aro  approximated  by 

« 

.A  A  .  v  r  A 

xk  *  Vk-1  +  Kk  S.VzkA-iJ 
pk  a  pkA-l  “  Kk  tnk+2^  J  PkA-l 


(2.14a) 

(2.14b) 


where 


\  *  \a-i  (V2\)T  C(,!k*2  \>pkA-i  <V2^)I+V2AJ'1 


A*  «  , .  A  A 

ek/k~  l  c  dk'XkA-l  +  ek 

th  a 

The  i  component  of  e^  is 

*ek*i  3  xk/k~l  Ek  xkA-l  +  traCe  Ek  Pk-1 

and  the  i^  row  of  is 


<Vi 


*  T  Ei 

xkA-l  Ek 


,th 


The  ij  element  of  the  matrix  is 

■  ,<Vlj  ■  trace  <EipkA-i  'kVk-i* 


Note  that  if  there  are  no  nonlinear  «;erms,  then  (2.13)  and  (2,14) 
reduce  immediately  to  (2.1l)  -  (2,12)* 

✓ 

,  i  “  * 

It  is  interesting  to  exataine  these  relations  by  comparing  (2.13)  -  (2.14) 
with  (2,11)  -  (2.12).  First,  note  that  tht-  nonlinearities  have  the 
effect  of  modifying  the  "linear  transition  matrix  in  (2,13b)  and  the 
linear  observation  matrix  in  (2,14b).  In  tne  former  the  of  the 
linear  filter  is  replaced  by  (F^+2  (j^)  and  in  the  latter-  the  is 
modified  to  (n^+2^^).  This  provides  the  primary  influence  of  the 
nonlinearity  upon  the  covariance  matrix.  Further,  there  is  an  additional 
effect  which  can  be  viewed  as  an  increase  in  the  yJant  and  measurement 
noise  covariance  matrices.  The  plant  noise  covariance  is  increased 

to  <Qk  2+29^)  ana  the  measurement  noise  covariance  becomes  <V2V- 
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III.  DERIVATION  OF  THE  APPROXIMATING  MOMENTS 

•  % 

In  this  Section  the  derivation  of*  the  moments  stated  in  Section  11 
is  discussed  in  terras  of  a  scalar  system.  The  scalar  system  is 
discussed  here  in  order  to  simplify  the  notation*  The  extension  to 
the  general  multidimensional. results  is  straight  forward* 


Consider  the  prediction  density  first.  The  only  assumption  required  to 
establish  and  is  that  the  a  posteriori  density  ) 

is  gaussian  at  each  sampling  time.  The  scalar  version  of  (2.1)  can  be 
written  as 


*k  '  Vi  +  gk  Vi  +  Vi 


(3.1) 


The  density  for  the  noise  sequence  will  be  written  as 

p(v  k>  =  kw  «P  -  |  (-^) 

V  1 

The  moments  of  p(x./z  )  can  be  obtained  from  (2.7)  by  noting  that 


E  [  x^/zk"1J  =  Jx*  p(xk/zk”1)dxJ| 


Substituting  (2.7)  and  iterating  integrals,  one  obtains 

■  fdlt  '(VVi,aik 


(3.2) 


The  density  p(xk/xk  j)  is  known  from  (3,1)  and  (2.4).  It  is 
/  /  ,  v  1  /xk  "  fk*k-i  "  ek*k-l, 

*<vVi)  ■ k.  •*»  -  2 ( - - >  . 


Using  this  relation,  the  innermost  integral  in  (3.2)  is  easily  determined 
In  the  case  when  i=l,  the  mean  value  is 


IvVi] 


e  xk-l  +  skxk-l 


«  o  « 


k*»l 

Then,  from  the  gaussian  property  for  p(x^^/z  ~  ),  one  obtains 

•'*  [  v'*k'13  * E  IVk-i -  Vw  A*'1] 

t  t 

*  fk  xk-l  +  Bk  ELxk-l^z  1 


*  /2  a  2  x 

\/*-i  *  fk  xk-i  +  gk*pk-l  +  xk-l} 


(3.3) 


This  is  the  scalar  version  of  (2.13a).  It  can  be  obtained  without 

» 

recourse  to  the  general  Bayesian  relation  (2.7)  simply  by  computing 
the  conditional  expectation  of  x^  using  (3,1), 


The  error  in  this  estimate  is  given  by 


xk  *  xk/k-l  c  rk^xk-r^k-l^  +  ek^xk-l“xk-l^2  +  Xk- l"xk- 1  ^ Xk- l"pk- 1] 


Using  this  result,  the  variance  is 


+  w 


k«l 


xk“xkA-l 


)W-t] 


Df 


2 

pkA-l 


(V8*k  Vi>2  Pk-i  *  ^ 


'k/k- 


+  q 


2 

k-1  (3.4) 


Equation  (3,4)  is  the  scalar  version  of  (2.13b).  Note  that  the  only 

A  2  k  1 

approximation  of  and  ^k/k-1  *s  *****  *hc  *s  6aussian. 

The  nonlinearities  destroy  the  gaussian  character  of  the  density  however 

so  (3,3)  and  (3.4)  will  be  approximations. 


The  effect  of  tho  plant  nonlinearity  upon  the  density  can  be  studied  by 
determining  the  third  and  fourth  central  moments.  If  the  density 
p(Xj£/Zjj._1)  were  gaussian,  then  the  third  central  moment  would  be  identically 
zero  and  the  fourth  central  moment  would  be  Consider  what  the 

values  actually  become.  The  third  central  moment  is  found  to  be 
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®&Wk-t*  ^  -Is  2fk  pk-l  f3^fk+2skXk-l)2"gk2pk- J 


(3.5) 


and  the  fourth  central  oonent  is  * 


Efxk"xkA-l)  ^  ^  =  3pkA-l  +  10sk  pk-l  +  4gk*fk+2gk  xk-l*  pk-l  *3,6) 

From  (3*5)  it  is  clear  that  the  presence  of  the  nonlinear  coefficient 
gk  will  cause  the  third  central  moment  to  be  nonzero.  Similarly,  the 
gk  causes  the  fourth  central  moment  to  change  from  the  gaussian  value. 

The  changes  from  the  gaussian  moments  are  sfeen  to  be  proportional  to 

a  e 

Pk  j  in  (3.5)  and  p^_j  in  (3.6).  If  the  filter  converges  to  the  true 

value  of  the  state  as  the  number  of  samples  becomes-  large,  one  sees  then 

that  the  gaussian-destr  \>ying  terms  converge  to  zero  much  more  rapidly 

2 

than  does  the  variance  p^.  This  implies  that  the  gaussian  approximation 
will  improve  as  k  becomes  large.  . 

The  derivation  of  the  moments  for  the  prediction  density  is  not  difficult* 
This  is  not  the  case  for  the  moments  of  the  filtering  density.  Consider 

W  i 

for  a  moment  the  calculation  of  p(xk/z  )  from  (2.6).  .First,  it  is  clear 

k- 1  * 

that  knowledge  of  p(x./z  )  and  the  measurement  equation 


*k  =  hk  xk+ekxk+vk 


(3.7) 


b 

defines  p(x./z  )  without  error.  The  density  for  the  f/  is  assumed 

»  1  Tk  2  ■  "  k 

to  be  p(  v^  )  *  ky  exp  -  ^(~)  »  That  is, the  pfx^/z  ),  using  (2,6),  is 

formed  by  an  algebraic  combination  of  the  densities.  Unfortunately,  it 

is  not  an  easy  task  to  determine  the  moments  of  the  density  that  results. 

To  understand  this,  note  that 


p(zkAk)  =  kvexp  - 


(3.8) 


1 .  Ir 

Assuming  that  p{x^/z )  is  gaussian,  the  pCx^/z )  can  be  computed  from 
(2.6)  and  can  be  written  as  - 


*  1  2  1 

p(xkAK)  =  kN  exp-|  «*P  -  jK 


2  3  2  4 

-2e.  z.x!"+2h!e!x|'+c£x? 

(^jLOg-JL^JL^)  (3.0) 
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where 


1  "k  1 

-2*2*  2 


pkA-l 


2  V5 A‘k  Vk-1  % 

fk '  ^  <“  ♦  -H-  > 

.  k  pkA-l 


The  kN  is  s  quantity  not  involving  the  state  x^w  The1^  and  ^  are 
the  variance  and  span  that  is  obtained  when  the  system  is  linear.  Thus, 
the  measurement  nonlinearity  modifies  the  filtering  density  for  linear 
syst'^s  by  appearing  as  an  exponential  factor.  This  factor  contains 
the  fourth  power  of  the  state.  It  is  apparent  that  the  moments 
e[x£/z*J  cannot  be  computed  directly  from  (3.9)  because  of  the  nature 
of  the  second  exponential  factor.  Thus,  it  becomes  necessary  to  approximate 
the  moments  in  some  way.  This  problem  has  been  discussed  in  Reference  5. 

For  this  discussion  the  approximation  shall  be  accomplished  indirectly. 
First,  one  sees  that  the  moments  relating  to  the  measurement  (i.e. 

can  be  computed  in  the  same  way  as  was  done  for  the 
prediction  density.  Thus,  one  obtains 


A  .  A  /  2*2% 

*kA-l  *  Vfyk-l  +  VpkA-l  *  ’Vk-r 


(3.10) 


Vhen  a  measurement  z^  is  processed,  the  error  in  the  estimate 
is  » 

ek  *  VVk-l 


Then,  hypothesize  that  an  estimate  x.  is  obtained  by  adding  the  error 

*  A  * 

linearly  to  the  predicted  estimate 

*k  *  *kA-l  +  VVVk-1* 


(3.11) 
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The  gain  Matrix  shall  be  determined  so  that  the  Mean ‘square  error 

♦ 

•;  Et  .£*  E(Xjt-Xjc)/zk’*^|  is  minimized. 

t 

Note  the  averaging  is  restricted  to  the  Measurement  z^.  The  error  is 
found  to  be 

V*k  *  <V*kA-l)  •  ‘lthk(vW"k  ( V V*- 1 ,4*2 Vk- i(VVk-i * 


*pnA-»1 


♦ 


I 


E 


I, 


a 


f 


It  follows  that  the  that  acconplishes  the  minimization  is  given  by 

<hk+2*k  S/k-l^kA^l 

[(hk+2Sk  PkA-l+2Sk  pkA-l+rV)  (3.12) 

The  variance  is  approximated  by  £ ^  i«  found  to  be 


pk  *  pk/k-l  ”  Ik<hk+,**kxkA-l^pkA-l 


(3.13) 


Equations  (3.10)  -  (3.13)  are  the  scalar  versions  of  (2.13)  -  (2.14). 
Although  these  results  are  admittedly  obtained  in  a  suboptimal  fashion, 
it  is  worth  observing  that  this  procedure  yields  the  optimal  solution 
for  linear  systems  [l]  •  Note  that  the  estimate  is  stagewise  unbiased, 
that  is  ,  one  finds  that  E's  .\,x^}  is  equal  to  E^x^vhen  Etxk-l}  *  *k-l# 
Computational  results  obtained  from  these  filter  equations  are  encouraging 
and  suggest  that  the  approximation  is  reasonably  accurate  and  certainly 
is  an  improvement  over  a  purely  linear  approximation. 
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IV.  A  NUMERICAL  EXAMPLE 

* 

•  , 

In. this  section  a  simple  problem  is  considered  in  order  to  compare  the 
linear  and  nonlinear  filters.*  The  example  has  been  discussed  previously 
by  Denham  and  Pines  £ 2 ]  in  connection  with  the  Inadequacy  of  a  linear 

filter  when  the  nonlinearity  is  comparable  to  the  measurement  noise. 

•  \ 

«* 

Consider  a  scalar  system  wit*,  a  static  plant 

* 

and  measurements  described  by 
*kBXk  +  Tk 

The  statistics  for  the  initial  state  and  measurement  noise  are 
®[*J  *>  a  {  e[(xo  -  a)2]  -  m2 
Efvkl*0|  e[v2]  «  r'2 

e 

The  state  x^  will  be  estimated  by  using  perturbation  theory.  Then, , 
the  results  of  the  preceding  sections  will  be  applied  to.  estimate  the 
perturbations.  The  perturbation  equations  are  ' 

(4.3) 

2^xk  *rk  ,  (4.4) 

state  and 


£*k*  K-i  . 

£.k  «  2xk't  $xk  + 


where  x^j  is  the  nominal 
t  • 

K*  vxk 


(4.1) 


(4.2) 
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The  nominal  was  chosen  to  be 

i 

»  • 

x  *  a  * 

o 

and  this  value  was  retained  for  all  k. 

This  system  was  simulated  on  a.  digital  computer*  The  initial  perturbation 

Sx  and  the  measurement  noise  sequence  were  obtained  from  a  Gaussian 
0 

random  number  generator  according  to  the  prescribed  statistics*  Some 

typical  results  are  portrayed  in  Figures  1  and  2*  In  these  Figures  the 

standard  deviation  p..  for  the  nonlinear  filter  is  plotted  as  Is  the 

absolute  value  of  the  error  *or  the  linear  and  nonlineai*  filter* 

The  p^  for  the  nonlinear  filter  is  given  because  it'  is  somewhat  larger 

than  for  the  linear  filter  but  is  not  significantly  different*  In  Figure 

1  the  variance  of  the  initial  perturbation  is  0*01  and  this  was  increased 

to  0.1  for  the  results  shown  in  Figure  2.  The  comparison  of  pfc  and 

indicates  that  the  linear  filter  behaves  somewhat  satisfactorily 

for  the  smaller  initial  deviation  (i.e.  Figure  1)  although  it  seems  to 

be  diverging  as  the  number  of  samples  increases*  In  the  second  case  the 

linear  filter  is  obviously  inadequate  because  the  error  and  its  statistic 

are  in  complete  disagreement*  On  the  other  hand  t) e  nonlinear  filter 
« 

gives  significantly  better  results  and  suggests  that  a  great  deal  more 

confidence  can  be  placed  in  the  estimates.  These  results  are  typical 

of  those  that  have  been  obtained.  (Unfortunately,  the  data  from  a 

complete  Monte  Carlo  simulation  cannot  be  presented  at  this  time)* 

» 

The  numerical  results  that  have  been  obtained  suggest  several  conclusions* 

The  two  most  important  are  stated  below. 

• 

(1)  Unless  the  measurement  noise  is  "small",  the  linear  filter  and  the 
nonlinear  filter  give  essentially  the  same  response*  A  precise 
definition  of  "small"  shall  not  be  attempted  other  than  to  say  that 
the  noise  must  be  small  relative  to  the  second  order  effects. 

(2)  When  differences  between  the  linear  and  nonlinear  filters  do  arise, 
the  latter  gives  consistently  better  results  in  the  sense  that  the 
error  and  the  statistic  are  consistent  with  one  another. 
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?.  CONCLUSIONS 

The  problem  of  obtaining  estimates  of  the  state  of  a  nonlinear  system 
is  frequently  solved  by  applying  linear  perturbation  theory  and  using 
linear  estimation  theory  to  determine  the  perturbations*  While  this 
procedure  is  frequently  satisfactory,  many  examples  have  been  encountered 
in  which  second  order  effects  are  small  but  nonnegligible.  The  application 
of  linear  theory  to  nonlinear  problems  is  essentially  a  means  of 
approximating  the  mean  and  covariance  of  the  a  posteriori  density.  In 
this  paper  attention  has  been  directed  toward  the  development  of 
approximations  of  these  moments  that  include  the  influence  of  second 
order  terms.  The  approximation  involves  the  explicit  assumption  that 
the  density  is  gaussian.  This  is  not  true  for  noniinoar  system  but 
it  is  felt  that  the  first  two  moments  will  not  be  severly  affected  as 
long  as  the  nonlinearity  is  small.  Certainly,  one  would  expect  the 
accuracy  of  the  moments  containing  the  second  order  effects  to  be  superior 
to  the  linear  moments. 

•  # 

The  Immediate  disadvantage  of  utilizing  a  quadratic  perturbation  theory 
lies  in  the  increased  number  of  system  matrices  that  have  to  be  determined. 
For  li.near  systems  the  linear  transition  matrix  is  (nxn)  and  the  observation 
matrix  is(mxn).  When  the  second  order  effects  are  included,  there  are 
n  additional  (nxn)  plant  matrices  and  m  more  (nxn)  observation  matrices  to 
be  calculated.  The  increase  in  computational  requirements  can  be 
catastrophic  for  large  n.  This  problem  can  be  circumvented  to  an  extent 
by  establishing  the  equations  in  which  the  second  order  effects  are 

significant  and  neglecting  the'  insignificant  effects  in  the  other  equations. 

\ 

Although  these  results  may  require  significantly  more  calculation,  they 
do  provide  a  systematic  way  for  modifying  linear  estimation  theory  to 
include  nonlinear  effects.  Numerical  results  for  simple  examples  indicate 
that  significant  improvements  in  the  response  is  possible  when  the 
nonlinearities  are  comparable  to  the  noise.  Thus,  a  sizable  increase  in 
the  range  of  applicability  of  perturbative  techniques  may  be  provided. 
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Linear  estimation.  theory  has  been  applied  extensively  to  nonlinear 
systems  by  assuming  that  perturbations  from  a  reference  solution  can 
be  described  by  linear  equations.  As  long  as  the  second  order  (and 
higher)  terns  in  the  perturbation  equations  are  negligible,  linear 
estimation  techniques  have  been  found  to  yield  satisfactory  response. 

Many  examples  have  been  encountered  in  which  the  linear  theory  is  not 
satisfactory,  however,  and  it  is  to  this  situation  that  attention  is 
directed  here.  Timo-discrete  systems  in  which  the  second  order  effects 
are  small  but  nonnegligible  are  considered.  Recursion  relations  for 
the  conditional  moan  and' covariance  are  developed,  While  these  relations 
yield  approximations  to  the  true  values  of  these  moments,  they  are 
'  superior  to  the  approximations  provided  by  applying  linear  theory  to 
a  nonlinear  system.  Some  results  for  a  simple  system  arc  presented 
in  which  the  response  from  linear  and  nonlinear  filters  is  compared. 
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